Systematic Association of Genes to Phenotypes by Genome and Literature Mining

نویسندگان

  • Jan O Korbel
  • Tobias Doerks
  • Lars J Jensen
  • Carolina Perez-Iratxeta
  • Szymon Kaczanowski
  • Sean D Hooper
  • Miguel A Andrade
  • Peer Bork
چکیده

One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene-phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

از ژنوم تا ژن: مروری بر ژن‌ها و تغییرات ژنتیکی موثر بر بروز بیماری دیابت نوع دو

Despite the valuable results achieved in identification of genes and genetic changes associated with type 2 diabetes (T2D), lack of consistency and reproducibility of these results in different populations is one of the challenges lie ahead in introduction of T2D candidate genes. Therefore, the present review article aimed to provide an overview of the most important genes and genetic variation...

متن کامل

Heritability for Stroke: Essential for Taking Family History

 There are many well-established factors that influence the risk of stroke including blood pressure, diabetes, low socioeconomic status and smoking, however, the shared genetic resource in members of a family effect on stroke predisposition. Genome-wide association studies (GWAS) have demonstrated evidence of a shared genetic source in stroke risk. This review considered the influence of family...

متن کامل

The Genetics of Non-Syndromic Primary Ovarian Insufficiency: A Systematic Review

Purpose: Several causes for primary ovarian insufficiency have been described, including iatrogenic and environmental factor, viral infections, chronic disease as well as genetic alterations. Given the large number of genes described in the literature so far, the aim of this review was to collect all the genetic mutations associated with non-syndromic primary ovarian insufficiency. Methods: All...

متن کامل

Computational prediction of miRNAs in Nipah virus genome reveals possible interaction with human genes involved in encephalitis

Current re-emergence of Nipah virus (NiV) in India caused 11 deaths so far and many patients were kept in quarantine. A thorough study of previous outbreaks occurred in Malaysia, Bangladesh and India represents cases with high rate of fatality due to acute encephalitis. Our work involves genome analysis of NiV for prediction of miRNAs and their targeted genes in human in order to understand enc...

متن کامل

P30: Are There Anxious Genes?

Anxiety comprises many clinical descriptions and phenotypes. A genetic predisposition to anxiety is undoubted; however, the nature and extent of that contribution is still unclear. Extensive genetic studies of the serotonin (5-hydroxytryptamine, 5-HT) transporter (5-HTT) gene have revealed how variation in gene expression can be correlated with anxiety phenotypes. Complete genome-wide linkage s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Biology

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2005